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1.  Introduction. 

Whitt  (1979)  shows  tiiat  in  a  Bayesian  analysis,  under  certain  general 
conditions,  the  larger  the  observations  obtained,  the  larger  stochastically 
will  be  the  appropriate  parameter  of  the  posterior  distribution.  His  inter¬ 
esting  results  are  developed  for  the  case  of  a  univariate  parameter.  In  the 
present  paper,  we  show  that  his  key  ideas  may  be  extended  to  the  multivariate 
parameter  case.  Our  basic  tool  is  multivariate  total  positivity  of  order  2, 
introduced  and  treated  by  Karlin  and  Kinott  (1980) . 

In  Section  3,  we  present  sufficient  conditions  on  the  likelihood  function 
and  on  the  prior  distribution  under  which  the  posterior  distribution  of  a 
multivariate  parameter  is  stochastically  increasing  in  the  sample  observations. 
In  Section  4,  we  show  how  these  results  may  be  applied  to  a  number  of  well 
known  multivariate  distributions,  such  as  the  normal,  the  multinomial,  the 
negative  multinomial ,  etc. 

In  a  subsequent  paper,  we  will  extend  Whitt 's  original  notion  to  show 
that  other  properties  of  the  data  are  "inherited"  stochastically  by  the 
parameter  posterior  distribution. 


2.  Preliminaries. 


In  tliis  section  we  present  definitions,  notation,  and  basic  facts  used 
throughout  the  paper. 

2.1.  Definition.  A  random  vector  X  is  said  to  be  stochastically 
increasing  in  a  random  vector  Y  if  Et$(X)|Y  *  yj  is  increasing  in  jr  for 
every  increasing  function  4  for  which  the  expectation  racists. 

k 

(A  function  *  on  a  subset  of  R  -►  R  is  said  to  be  increasing  if  it  is 
increasing  in  each  of  its  arguments) . 

Let  Xj,  X^,  ...»  X^  be  totally  ordered  spaces.  Let  X  *  *  X2  *  . ..  *  X^. 

For  every  x,  c  X,  define: 

x  v  (maxCxj,  y1),  ....  max(xk,  yk)) 

x  a  £«  (minCx^  yx),  ....  min(xk>  yk)). 

The  following  definitions  are  due  to  Karlin  and  Rinott  (1980) . 

2.2.  Definition.  Consider  a  function  f(x)  defined  on  X.  We  say  that 
f(x)  is  multivariate  totally  positive  of  order  2  if: 

f(x  v  y)f(x  a  jr)  a  f (x)  f (jr) 


for  eveTy  x,  c  X,  we  write  f  is  rtTP2 . 

The  following  is  a  multivariate  generalization  of  the  familiar  monotone 
likelihood  ratio  property. 

Ir 

2.3.  Definition.  Let  f^  and  f2  be  two  probability  densities  on  R  .  If 


f2(x  v  j^fjlxA  jr)  2  f^xJfjCjr) 


for  every  x ,  £  c  R  ,  then  we  say  that  f  2  >jp  f^ . 

2 


3 


ij 

2.4.  Theorem.  Let  and  f2  be  probability  densities  on  R  such  that: 
f2  >TP2  £1* 

V 

Let  $  be  an  increasing  function  on  R  .  Then: 

/♦OOfjCx)  s  ft(x)f2(x) . 

For  a  proof,  see  Karlin  and  Rinott  (1980). 
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3.  Theoretical  Results. 

In  the  sequel let  8  be  a  parameter  taking  values  in  the  space 
e  -  ex  X  ...  x  9^,  where  each  is  totally  ordered.  Let  X  be  an  observable 
random  vector.  Let  n(x|e)  denote  the  conditional  p.d.f.  (or  probability 
mass  function)  of  X  given  £  and  let  n(0)  be  the  prior  p.d.f.  (or  probability 
mass  function)  of  e_.  Finally,  let  n(ejx)  be  the  posterior  p.d.f.  (or 
probability  mass  function) . 

3.1.  Theorem.  Suppose 

a)  n(x|e)  *  h(x)g(x,  0),  where  g(x,  0)  is  MTP2  on  *  0»  and 

b)  n(0)  is  Mn*2  on  o.  (3.1) 

Then 

n(e|x(2))  >»p  n(e|x(1)) 

for  every  x^ ,  x^  e  Rn  such  that  x^  s  x^ . 

Proof.  The  posterior  density  is  given  by: 

n(o|x)  -  C(x)g(x,  0)11(0), 

idler e: 

C(x)  -  Lf  g(x,  OjlKeJd©!)"1 
0  ” 

Let  s  x<2> .  Define: 

f^O)  «  n(e|x(1)),  f2(0)  -  n(ejx(2)). 

Let  0,  v  «  9.  irfrite: 

fj^fjCs)  -  C(x(1))g(x(1),  yn(i)C(x(2))g(x(2),  v)n(v) 

-c(x(1))C(x(2))g(x(1),  e)g(x(2),  v)n(0)n(v) 

s  C(x^)C(x^)g(x^  a  x(2),  0  a  v)g(x^  v  yS2\ 

9  v  v)n(0  a  v)n(0  v  v), 
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since  both  g(x,  6)  and  n(e)  are  ^ttls 

*  C(x^)g(x^,  e_  a  v)n(e_  a  v)C(x^^g(x^2\  e_  v  v)n(£v  v) 


Hence 


-  ^(6  A  v)  f  2  (£  V  v). 

f2  *TP2  fl*  N 


3.2.  Corollary.  If  (3.1)  holds,  then  9^  is  stocliastically  increasing 

in  X. 

Proof.  The  result  follows  immediately  from  Theorems  2.4  and  3.1.  || 
The  conditions  required  to  apply  the  previous  Theorem  3.1  are  too 
strong  in  many  cases,  and  so  we  consider  tne  following. 

3.3.  Theorem.  Suppose: 

k 

n(x|e)  -  h(x)  n^gi(ei,  x^CCe), 


where  x4)  is  TP2  in  (e^  x^,  i  -  1,  ...»  n. 

Then: 


(3.2) 


n(©j |xx  m  Xj,  ...»  Xj  •  x j ,  ...,  xk  ■  x^)  >jp  n(flj |x^  *  x^,  ...,  Xj 


Xj »  •  •  • ,  Xk  ■  X|P 


for  every  x.  <  x,'  and  for  every  j  »  1,  2,  ...,  k. 

J  J 

Proof.  The  marginal  posterior  density  of  is 


n(e.|x)  •  h(x)g.(e.,  x.)/.../  n  g.(e.,  x.)H(e)  n  de4, 
j  j  j  j  j«i  A  1  *■  i«l  x 


where  11(e)  ■  C(e)n(e).  Define  : 


0 


Kej»  x 


a) 


k  k 

-/.../  n  g.(e.,  x.)H(e;  n  de., 

i-1  1  1  1  i-l  1 

i^j 


where  x^  ■  (x^  ....  Xj_k,  xj+j»  •••»  xk).  Thus: 

HC0J |x)  -  h(x)gj(6j,  Xj)  \  (6j,  x^). 

Let  Xj  <  xj.  Define: 

fl(®j)  *  n(9j  |X^  *  xr  •  •  •  >  Xj  -Xj<  •  •  • »  Xk  —  xk), 
f2(6j)  -  n(6j|X1  -  Xy  ....  Xj  -  xj,  xk  -  xk). 

Thus: 


h(xr  •  •  • »  xj,  xk)gj(8j,  xj) 

"  hlXj,  ....  Xj,  Xk)gjl6j,  Xj) 


is  increasing  in  6j  by  the  TP^  property  of  gj(6j,  xj).  Hence: 

f  2  >TP2  fl '  I  * 

3.4.  Remarks.  Note  that  the  result  in  Theorem  3.3  pertains  to  the 
marginal  posterior  density.  Also,  the  result  holds  irrespective  of  which 
prior  distribution  we  use.  It  follows  from  Theorems  2.4  and  3.3  that 
SXGj)^  ■  Xj,  ...»  Xj  -  Xj,  ...»  Xk  -  x^  is  increasing  in  Xj  for  every 
increasing  function  4. 

If  we  consider  the  case  where  e  is  of  dimension  two,  we  have  the 
following  result. 


■  - 1  -  — 


i&m 


3.5.  Theorem.  Suppose 


2 

nCej,  e^,  x2)  -  hCxj,  x2)  n^gi(ei,  x-KCe^  e2), 

where  g^e^,  xp  is  TP2  in  (8^  xp  for  i  -  1,  2,  and  CC-e^,  02)  is  TP2  in 

(®1»  ®2^* 

Then  for  x2  <  x|, 

*2)  >TP  x2^  * 

A  similar  result  holds  for  the  marginal  posterior  distribution  of  e2. 

Proof.  The  posterior  density  of  8j  is: 

n(e1|x1,  x2)  -  h(Xj,  x2)g1(e1,  x^f^Ce^  x^cCe^  e2)de2. 

Define: 

x2)  *  Jg2 (02»  x2)C(0j,  02)d®2‘ 

Then  +(-0^  x2)  is  TP2  in  (0^,  x2)  by  the  basic  composition  formula. 

CSee  Karlin,  1968,  pp.  98-103.]  Let  x2  <  x2-  Then: 

nCOjJxj,  x2)  h(xj_,  x2)  \  (e^  x£) 
ntSjlxj,  x^)  "  xp  f  tej*  xp  ' 

which  is  increasing  in  0j^  by  the  TP2  property  of  +(-0^  x2). 

3.6.  Remark.  Note  that  Theorem  3.5  involves  conditions  on  the  prior 
distribution.  It  follows  from  Theorems  2.4  and  3.5  that  El$(0j) (Xj  ■  Xj,  X 
is  decreasing  in  x2  for  every  increasing  function  *. 
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4.  Applications. 

In  this  section,  we  present  applications  of  the  results  of  Section  3. 
4.1.  Multivariate  Normal  Distribution.  Let  X|e_ be  normally  distributed 
with  mean  vector  e  and  a  known  covariance  matrix: 


Then,  whatever  prior  we  use,  the  conditions  of  Theorem  3.3  are  satisfied. 

Hence  from  Remark  3.4,  we  conclude  that  EC^(6j) |X^  *  x^,  . ..,  Xj  ■  x^,  . ..,  X^  ■  x^J 

is  increasing  in  Xj  for  every  increasing  function 

Consider  the  conjugate  prior  for  e_,  namely  the  multivariate  normal 

distribution  with  mean  vector  u  and  covariance  matrix  0.  Let  d1J  denote  the 

(i,  j)th  element  of  D1.  If  d^  s  0  for  every  i  +  j,  then  the  conditions  of 

Theorem  3.1  are  satisfied.  It  follows  from  Cor.  3.2  that  is  stochastically 

12 

increasing  in  X.  If  k  ■  2  and  d  *  0,  then  the  conditions  of  Theorem  3.5 
are  satisfied.  It  follows  that  E[$(e^)|Xj  ■  x^,  X£  *  X£]  is  decreasing  in  X2» 

4.2.  Multinomial  Distribution.  Let  X | £  have  the  multinomial  distribution 
with  parameters  n  and  e_.  Then 

k  x .  k 

it(x|e)  -  h(x)  n  e*  I  (I  e.  -  1), 
i-1  i-1 


where 


k 

I(  l  -  1) 

i-1  1 


1 


ei 


1 


0  otherwise  • 


Now  e.1  is  TP,  in  (9.,  x.).  Hence  the  conditions  of  Theorem  3.3  are 

1  6  IX 

satisfied.  It  follows  that  for  any  prior,  EC ♦ ) |X^  *  x^,  ...»  Xj  *  x^t  ... 
is  increasing  in  x^  for  every  increasing  function  4. 

If  k  ■  2  and  we  use  the  conjugate  prior  for  0_,  namely  the  Dirichlet 
distribution  with  parametric  vector  a  *  (o^,  a2),  the  posterior  density  is 

2  a.+x.-l 

n(ex,  e2|x1,  x2)  -  h(xx,  x2)  e.1  l(e1  +  e2  -  l). 

Since  IC-Bj  ♦  e2  ■  1)  is  TP2  in  (e^,  e2),  then  by  Theorem  3.5, 

EC*(©X) |XA  ■  Xj,  X2  ■  x2j  is  decreasing  in  x2  for  every  increasing  function  $ 

4.3.  The  Negative  Multinomial .  Let  Xje^  have  the  negative  multinomial 
distribution  with  parameters  N  and  9_.  Then 

k  x.  k 

n(x|e)  -  h(x)  n  e.1  l(  l  e.n). 

-  i-l  1  i-l  1 

Hence  we  can  conclude,  using  Theorem  3.3,  that  El ♦ C© j ) |X^  *  x^,  . ..,  Xj  * 

Xj,  ...»  ■  x^j  is  increasing  in  x^  for  every  increasing  function  4. 

If  k  »  2  and  we  use  the  conjugate  prior  for  0^,  namely  the  Dirichlet 
distribution,  we  can  deduce  that  EC ♦  (0-^)  *  x^,  X2  *  x2j  is  decreasing  in  x 

for  every  increasing  function  4. 

4.4.  The  i4ultivariate  Hypergeometric  Distribution.  Suppose 

k  6  k 

n(x|e)  «  h(x)  n  fj)K  l  9.  »  N). 

~ j»i  xj  i-l 

e. 

Since  (•*)  is  TP,  in  (9.,  x.),  it  follows  by  Theorem  3.3  that  for  any 

J  ^  J  J 

increasing  function  4,  EC^(e^) |X^  -  x^,  ...»  X^.  -  x^,  ...»  »  x^j  is 

increasing  in  x^.  If  k  ■  2,  Theorem  3.5  applies  and  it  is  easy  to  show  that 
Theorem  2  in  Whitt  (1979)  follows  immediately. 


4.5.  Uniform  Distribution.  Suppose  T^,  T2,  . ..,  T  is  a  random  sample 

from  a  uniform  distribution  on  the  interval  (e^,  e2).  Let  X^  »  min(T^ ,  T2,  ...»  Tr), 
X2  *  raaxCTj^,  T2,  . ..,  Tn).  Suppose  also  that  we  use  the  conjugate  prior, 
namely  the  bilateral  bivariate  Pareto  distribution  [see  DeGroot,  (1970), 
pp.  172-174J.  It  is  easy  to  show  that  Theorems  3.3  and  3.5  apply  and  so  we 
conclude  that  £[*(fl^)|X^  *  x^,  X2  *  x2J  is  increasing  in  and  decreasing  in 
x2  for  every  increasing  function  4. 

4.6.  The  Exponential  Family  of  Distributions.  We  obtain  an  application 
of  a  general  nature  of  the  results  of  this  paper.  Let  X ( 6_ belong  to  the 
exponential  family,  reparametrized,  so  that: 

lt(x|e)  *  h(x)e^-  —  C(e) . 

°ixi 

Since  e  is  TP,  in  (e.,  x.),  it  follows  by  Theorem  3.3  that  for  any 
increasing  function  4,  El4(ej)|X^  «  x^,  ...,Xj  »  x^,  ...»  X^  *  x^J  is 
increasing  in  Xj . 
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